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Large antenna arrays at the transmitter (TX) has recently been shown to achieve remarkable intra-cell 
lyj \ interference suppression at low complexity. However, building large arrays in practice, would require the use 

of power-efficient RF amplifiers, which generally have poor linearity characteristics and hence would require the 
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use of input signals with a very small peak-to-average power ratio (PAPR). In this paper, we consider the single- 
user Multiple-Input Single-Output (MISO) channel for the case where the TX antennas are constrained to transmit 
signals having constant envelope (CE). We show that, with per-antenna CE transmission the effective channel seen 
by the receiver is a SISO AWGN channel with its input constrained to lie in a doughnut-shaped region. For a 
broad class of fading channels, analysis of the effective doughnut channel shows that under a per-antenna CE input 
constraint, i) compared to an average-only total TX power constrained MISO channel, the extra total TX power 
■ required to achieve a desired information rate is small and bounded, ii) with N TX antennas an 0{N) array power 

5t ■ gain is achievable, and iii) for a desired information rate, using power-efficient amplifiers with CE inputs would 

require significantly less total TX power when compared to using highly linear (power-inefficient) amplifiers with 
high PAPR inputs. 
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I. Introduction 

The high electrical power consumption in cellular base stations (BS) has been recognized as a major 
problem worldwide One way of reducing the power consumed is to reduce the total radiated radio- 
frequency (RF) power. In theory, the total radiated power from a BS can be reduced without affecting the 
downlink throughput, by increasing the number of antennas. This effect has been traditionally referred to as 
the "array power gain" In addition to improving power-efficiency, there has been a great deal of recent 
interest in multi-user Multiple-Input Multiple-Output (MIMO) systems with large antenna arrays |[3l, [|4ll, 
due to their ability to substantially reduce intra-cell interference with very simple signal processing. In 
general, multiple antenna beamforming is a well known technology to improve link performance 

To illustrate the improvement in power efficiency with large antenna arrays, let us consider a MISO 
channel between a transmitter (TX) having > 1 antennas and a single-antenna receiver. With knowledge 
of the channel vector (h = {hi,h2,--- , /iAr)^) at the TX and an average-only total transmit power 
constraint of Pt, an information symbol u (with mean energy IE[|Mp] = 1) can be beamformed in such 
a way (the i-th antenna transmits -y/P^/ilu/HhHJ that the signals from different TX antennas add up 
coherently at the receiver (the received signal is -y/^llhll^n), thereby resulting in an effective channel 
with a received signal power that is ||h||^/|/iip times higher compared to a scenario where the TX has 
only one antenna. For a broad class of fading channels (e.g., i.i.d. fading, single-path direct-line-of-sight 
(DLOS)) II h II 2 = \hi\'^0{N), and therefore, for a fixed desired received signal power, the total transmit 
power can be reduced by roughly half with every doubling of the number of TX antennas. This type of 



beamforming is referred to as "Maximum Ratio Transmission" (MKT) (see Fig. |l(a)[ ). 

In theory, to achieve an order of magnitude reduction in the total radiated power (without affecting 
throughput) we would need TX with a large number of antennas (by large, we mean tens or even hundreds 
S, [HI). However, building very large arrays in practice requires that each individual antenna, and its 
associated RF electronics, be cheaply manufactured and implemented in power-efficient technology. It is 
known that conventional BS are highly power-inefficient. Typically, the ratio of radiated power to the total 
power consumed is less than 5 percent, the main reason being the use of highly linear and power-inefficient 
analog devices like the power amplifier [6]!^ Generally, high linearity implies low power efficiency and 

^In a conventional BS, about 40 — 50 percent of the total operational power is consumed by the power amplifier and the associated RF 
electronics, which have a power efficiency of only about 5 — 10 percent |[6). 
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vice-versa. Therefore, non-linear but highly power-efficient amplifiers must be used. With non-linear power 
amplifiers, the signal transmitted from each antenna must have a low peak-to-average-power-ratio, so as 
to avoid significant signal distortion. The type of signal that facilitates the use of the most power-efficient 
and cheap power amplifiers/analog components is therefore a constant envelope (CE) signal. 

With this motivation, in this paper, we consider a single-user Gaussian MISO fading channel with the 
signal transmitted from each TX antenna constrained to have a constant envelope. Fig. |l(b)| illustrates 
the proposed signal transmission under a per-antenna CE constraint. Essentially, for a given information 
symbol u to be communicated to the single-antenna receiver, the signal transmitted from the i-ih. antenna 
is PT/Ne^^"^ . The transmitted phase angles (6'", ■ ■ ■ , 9"^) are determined in such a way that the noise- 
free signal received matches closely with u. The amplitude of the signal transmitted from each antenna 
is constant and equal to ^JP^/N for every channel-use, irrespective of the channel realization. By way 
of contrast, with MRT, the amplitude of the transmitted signal depends upon the channel realization as 
well as on u, and can vary from to \/Pr\u\. Since the CE constraint is much more restrictive than 
the average-only total power constraint in MRT, a natural question which arises now is how much array 
power gain can be achieved with the per-antenna CE constraint. Also, compared to MRT, how much extra 
total transmit power is required with per-antenna CE transmission to achieve a given information rate? 

So far, in the open literature, these questions have not been addressed. For the special case of = 1 
(SISO AWGN), the channel capacity under a CE input constraint has been reported in [|7]|. However for 
A^ > 1, known reported works on per-antenna power constrained communication consider an average- 
only or peak-only power constraint [|8l- [[T3l . For the single-user scenario, in |[8l, the author considers the 
problem of finding the optimal transmit and receive matrices which maximize the received signal-to-noise- 
and-interference-ratio (SINR) in a MIMO channel, subject to a per-antenna average power constraint at 
the TX. In |l9l, the author has derived a closed-form expression for the capacity of a single-user MISO 
channel with a per-antenna average power constraint at the TX. In ifTOl . the authors compute bounds on 
the capacity of a noncoherent single-user MIMO channel with peak per-antenna power constraints at the 
TX. 

For the multiuser MIMO broadcast channel with per-antenna power constraints, in [[TTll the authors 
consider minimization of the per-antenna average power radiated by the transmitter subject to a minimum 
SINR constraint for each user in the downlink. They propose efficient numerical methods for solving this 
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problem using uplink-downlink duality. In [fT2l . the authors study the optimal multi-user linear zero-forcing 
beamformer which maximizes the minimum information rate to the downlink users, under per-antenna 
average power constraints at the BS. In [[T3l . the authors consider the scenario where users can also have 
multiple antennas, and propose methods to maximize the weighted sum-rate and max-min rate under a 
per-antenna average power constraint at the BS. 

In contrast to the above works on per-antenna average/peak power constrained communication, in this 
paper, we consider the more stringent per-antenna constant-envelope constraint where each antenna emits 
a signal of constant amplitude y^Pr/N. 

The specific contributions presented in this paper are: i) we show that, under a per-antenna CE constraint 
at the TX, the MISO channel reduces to a SISO AWGN channel with the noise-free received signal being 
constrained to lie in a "doughnut" shaped region in the complex plane, ii) using the equivalent doughnut 
channel model, we derive analytical upper and lower bounds on the MISO channel capacity under per- 
antenna CE transmission, iii) under per-antenna CE transmission, for large we show that the optimal 
information alphabet (in terms of achieving capacity) is discrete-in- amplitude and uniform-in-phase, and 
iv) we also propose novel algorithms for transmit precoding under the per-antenna CE constraint. Our 
analysis shows that for a large class of fading channels (i.i.d. Rayleigh fading, i.i.d. fading channels where 
the channel gains are boundej^ DLOS), i) under the per-antenna CE constraint, an array power gain of 
0{N) is indeed achievable with N antennas, ii) for a desired information rate to be achieved, compared 
to the MRT precoder with an average-only total transmit power constraint, the extra total transmit power 
required under per-antenna CE transmission is small and bounded, iii) by using a sufficiently large antenna 
array, at high total transmit power Pt, the ratio of the information rate achieved under the per-antenna 
CE constraint to the capacity of the average-only total transmit power constrained MISO channel can be 
guaranteed to be close to 1, with high probability. This stands in contrast to Wyner's result in |I71 for 

= 1, where this ratio is only 1/2 at high Pt. Analytical results are supported with numerical results 
for the i.i.d. Rayleigh fading channel. The analysis and algorithms presented are general and applicable 
to systems with any number of transmit antennas. 

Notation: C and R denote the set of complex and real numbers. \x\, x* and arg(x) denote the absolute 
value, complex conjugate and argument of x G C respectively. For any positivep > 1, ||h||p = |/ij|^)^/^ 

^In practice, real-world channels generally have bounded channel gains. 
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denotes the Euclidean p-norm of h = (hi, ■ ■ ■ , h^) G C^. E[-] denotes the expectation operator, log(-) 
denotes the natural logarithm, and log2(-) denotes the base-2 logarithm. Abbreviations: r.v. (random 
variable), bpcu (bits -per-channel- use), p.d.f. (probability density function). 



II. System model 

We consider a single-user MISO system. The complex channel gain between the ?-th transmit 
antenna and the single antenna receiver is denoted by hi, and the total channel vector is denoted by 
h = [hi, h2, - ■ ■ , hj^Y . TX is assumed to have perfect knowledgecpf h, whereas the receiver is required 
to have only partial knowledge (we shall discuss this later in more detail). Let the complex symbol 
transmitted from the ^-th antenna be denoted by Xt. The complex symbol received is 

N 

y = ^hiXi + w (1) 

i=l 

where w denotes the circularly symmetric AWGN having mean zero and variance cr^, i.e., CJ\f{0,a'^). 
Due to the CE constraint on each antenna and assuming a total transmit power constraint of Pt, we must 
have = Pt/N , i = 1, . . . , N. Therefore Xi must be of the form 

Xi = ^e^'^ , t = l,2,...,N (2) 

where j = and 9i e [— vr , vr) is the phase of Xi. We refer to the type of signal transmission in Q 

as "CE transmission". Note that under an average-only total transmit power constraint, the transmitted 
signals are only required to satisfy IE[X]i kiP] = Pt, which is much less restrictive than Q. Under CE 
transmission, the signal received is given by (using ([T]) and Q) 

N 

N 



y = \j^Y.^^e''' + '^■ (3) 

i=l 



Let Q = {6i,02, - ■ ■ , OnY denote the vector of transmitted phase angles and let m G W C C denote the 
information symbol to be communicated to the receiver, where U is the information symbol alphabet. For 
a given u, the precoder in the transmitter uses a map $(■) : U — )■ [— vr, tx)^ to generate the transmit phase 

'^For large A'^, with Time-Division-Duplex (TDD) communication and assuming a reciprocal channel, channel measurements at the TX 
using reverse link pilot signals can be used to estimate the forward channel. A preliminary study done by us reveals that, the performance of 
the proposed CE transmission scheme degrades with increasing estimation error variance. However, interestingly, with i.i.d. Rayleigh fading 
the performance loss is small even when the standard deviation of the estimation error is of the same order as the average channel gain. 
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angle vector, i.e., = Let the set of possible noise-free received signals scaled down by ^fPr, i.e., 
i Eii be given by 

(4) 



M[\^) = { ^'=^Z , ^.G [-7r,7r) z = l,...,iV 



N 



e]^) such thalE 



By choosing U. C A^(h), for any u eU, it is implied that u E A^j^h), and therefore from dH) it follows 
that, there exists a phase angle vector 6" = {t 

With the precoder map 



(5) 



where 9" satisfies ([5]), the received signal is given by 

y = ^/Pru + w 



(6) 



(7) 



i.e., the noise-free received signal is the same as the intended information symbol u scaled up by \/Pt- 
Subsequently in this paper, we propose to choose U C A^(h)P and define the precoder map as in ^ and 
With U C A^(h) it is clear that the information rate depends on A^(h). In the next section, we give 
a more detailed characterization of A^(h). 



III. Characterization OF A^(h) 

We characterize A^(h) through a series of intermediate results. First, we define the maximum and 
minimum absolute value of any complex number in A^(h). 



M(h) 



max 

0={ei,-,ejv),9«e[-7r,7r) 



E*=i hie- 



N 



, m(h) 



mm 

G=(6(i,-,ejv),e,G[-7r,7r) 



N 



(8) 



Lemma 1: If z e A^(h) then so does ze^'f' for all (j) G [— vr, tt). 

C 7V/((h) implies tliat tlie information symbol alphabet must be chosen adaptively with h. Therefore the receiver must be informed 

about the newly chosen W, every time it changes. However, we shall see in Section |Ill] that the set A4{h) is the interior of a "doughnut" 

shaped region in the 2-dimensional complex-plane and can therefore be fully characterized with only two non-negative real numbers (the 

inner and the outer radius of the doughnut). Hence, the TX only needs to inform the receiver about these two numbers every time h changes. 
*For U % M(\\), it may be possible to consider a precoder map which satisfies (|5} for u G 7V/f(h), and for any it ^ 7V/f(h) finds 

the phase angle vector which minimizes the non-zero energy of the residual/error term ( '-^^ u \ . However, even with such an 

error-minimizing precoder, it has been observed via simulations that having U % M{\\) does not increase the achievable information rate 

compared to when U C 7V/f(h). 



Proof - Since z E A^(h), from Q it follows that there exists a phase vector 0^ = [Ol.O^,-- - ,9]^) 

such that z = l^'' ' • Consider the phase angle vector Q"" = {91,92, - ■ ■ , 9^^) with 9-=9-+(f),i = 

1, 2, ■ ■ ■ , A^. It now follows that, ze^"^ = e^'^ ^-^ = ^-^^^ — = G A^(h). ■ 

Essentially Lemma [U shows that the set A^(h) exhibits a circular symmetry in C. The following two 

lemmas characterize M(h) and m(h). 
Lemma 2: 

v^Af IT I 111 II 

M(h) = ^-=Li ^' = i^^. (9) 



Proof - The proof essentially follows from the extended triangular inequality 

hi 



with equality achieved when e^^' = -i^, i.e., 9i = — arg(/ij). 



m(h) < — = ^ . (10) 



Lemma 3: 



Proof - Let the absolute values of the components of h be ordered as > |/ij2 1 > . . . > By 
choosing the phase angles to be 9i^ = — a,rg{hi^) for odd k and 9i^, = — (arg(/ij^) +7r) for even k, for even 
N we have Eti^ie'^' = Y.LiK^^'^'" = Ek=i (I^i2.-il - l^ij) < Ek=i - \hi,+A) = \hi,\ - 



\hij < \hi^ \ = ||h||oo. Similarly, for odd N, we have Y.^^i^iC^'^' = Efc=i ^^^"^ (I^i2fc-il - \hi2k\) 



^(7V-l)/2 



< 



Y.k=i (l^ifcl - l^ifc+il) + l^iivl = l^nl - l^iiv-il + l^iivl < I^hI = l|h||oo- The proof now follows 
from the definition of m(h). ■ 
In Appendix lAl for the i.i.d. Rayleigh fading channel, we analytically show that for any constant c > 0, 
limAr_i.oo Prob^m(h) > = 0, which essentially means that for any arbitrarily small e > 0, there 

exists a corresponding integer N{e, c) such that Prob^m(h) > ^^^^-^ j < e for all N > N{e, c). Basically, 
it means that for sufficiently large N, with very high probability m(h) < ■ Since '^''^^^ — > as 

— > oo, it follows that with increasing A^, m(h) approaches with high probability. A similar result has 
been stated in lfT4l . where it has been shown that for large A^, ||h||oo = maxj \hi\ = E[|/ij|]0(log(A^)). 
Numerical results for the i.i.d. Rayleigh fading channel have however revealed that, with increasing A^, 
m(h) goes to zero at a significantly faster rate than \og{N)/\/N (implying that the upper bound in 
(flOl) is not quite tight). We illustrate this fact in Fig. [21 where we plot the mean value of the ratio 
m(h)/M(h) and its upper bound ||h||oo/||h||i as a function of increasing A^. For i.i.d. fading channels 
where the channel gains are bounded, i.e., \hi\ < M = 1, 2, ■ ■ ■ , A^ for some constant M, it follows 



that ||h||oo is also bounded (||h||oo < M) and hence ||h||oo / V N will converge to as — )■ oo. Since, 
m(h) < ||h||oo/ VN, it immediately follows that m(h) — as — j- ooO For the single-path only 
DLOS channel with \hi\ = ■ ■ ■ = \hiy\, it can be shown that for any N >2 and any h, m(h) = 0. (With 
Oi = ^^(^ - arg(/ii), z = 1, 2, . . . , iV, it is clear that ^. hiC^^^ = 0.) 

The next theorem characterizes the set A^(h). 

Theorem 1: 

A^(h) = 1^ I ^ G C, m(h) < 1^1 < M(h)|. 



(11) 



Proof - Let 



arg mm 

eie[-TT,Tr) ,i=l;2,...,N 



N 



Consider the single variable function 

f{t)-- 



it) 



N 



t e [0, 1] 



where the functions 9i{t) , i = 1,2, N are defined as 



9,it) = {l-t)9t -taigihi) , tE [0,1]. 



(12) 



(13) 



(14) 



Note that /(t) is a differentiable function of t, and therefore it is continuous for all t G [0, 1]. Also from 



(fT2l) . Lemma [2] and ^ it follows that 

/(O) = m(h)2 , /(I) = M(h)2 



(15) 



Since f{t) is continuous, it follows that for any non-negative real number c with m(h)^ < < M(h)^, 
there exists a value of t = t' G [0, 1] such that 



Let 



fit') = c\ 



(16) 
(17) 



From the definition of A^(h) in it is clear that z' E M{h.). From ([H]), ([JT]) and it follows that 

(18) 



c. 



Therefore, we have shown that for any non-negative real number c G [m(h) , M(h)], there exists a 
complex number having modulus c and belonging to A^(h). 

Further, from Lemma [H we already know that the set Ai (h) is circularly symmetric, and therefore all 
complex numbers with modulus c belong to Ai(h). Since the choice of c G [m(h) , M(h)] was arbitrary, 
any complex number with modulus in the interval [m(h) , M(h)] belongs to A^(h). ■ 

^We conjecture that m(h) — > as W — oo even if the i.i.d. channel gain distribution has unbounded support, though we do not have a 
rigorous proof of this statement. 
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A. The proposed precoder map = 0" 

The proof of Theorem [T] is constructive and for a given u eU C Ai{h), it gives us a method to find 
the corresponding phase angle vector 9" = {9^, ■ ■ ■ , 9^) which satisfies ([5]). For a given u eU C A^(h), 
we define the function fu(t) = f{t) — \u\'^ , t G [0, 1] where f{t) is given by (fT3l) . Using Newton-type 
methods or simple brute-force enumeration, we can find a t = satisfying fu{tu) = (the existence of 
such a tu is guaranteed by the constructive proof of Theorem [T]). The phase angles which satisfy (|5]) are 
then given by 9^ = 9i{tu) + where 9i{t) is given by (fT4l) . and is given by e^'^ = n "/^g (t^.) • 

Yet another method to obtain 0" is to minimize the error norm function e"(e) = \u-^^^^hie^'>^ /^/N\'^ 
w.r.t. 6. For large N, it has been observed that, most local minima of the error norm function have small 
error norms, and therefore low-complexity methods like gradient descent can be usedO For very small 
iV = 2, 3 there exist closed-form expressions for 6" as shown belowO From the expressions for the 
phase angle vector for very small N, and the existence of low-complexity gradient-descent type methods 
for large N, it is expected that the computational complexity of the proposed CE scheme would not be 

*One method, that we have empirically found to have fast local minima convergence, is to sequentially update one phase angle at a time 

while keeping the others fixed in such a way that the objective function value e"(0) = |u — X^fli hie'^'- jyfN^ decreases with every 

update. Each update is a simple one-dimensional optimization problem, and since the convergence is fast, the order of complexity is expected 

to be the same as the MRT scheme, i.e., 0{N). 

' For iV = 2, m(h) = \\hi\ - \h2\\/V2, and M(h) = {\hi\ + |/i2|)/\/2. For any u G M{h), i.e., m(h) < lu| < M(h), the 

corresponding phase angle vector 0" = (S" ,62)'^ which satisfies ([Sj is given by 

6I2 = cos^M ^— t— ^— — —] + arg(u) - arg(/i2) , = arg ( ( ii - 



V2\u\\h2\ J ' ' . ^ V ^ 

Note that there can be two possible solutions, since cos~^(-) can take two possible values in [— tt tt). 
For iV = 3, M{h) = (|/ii| + \h2\ + \hii\)/V3, and m(h) is given by 



m(h) 



\\hi\ - |/i2|| < \hs\ < \hi\ + I/12I 



V3 

For any it G M{h), i.e., m(h) < |it| < M(h), the corresponding phase angle vector is 0" = {9i , 62 , ^3)^, with 6^ satisfying 

< COS (^3 + arg(/.3) - arg(.)) < 

Note that, S3 can take iirfinitely many values. For a chosen 6^3, let u\ = ^J%(^ ~ ''^^ ^ j . The remaining angles are then given by 

-1 / Wl? + ^ - , , , fV2f h2 ,01^ ' 
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significantly larger than the complexity of the MRT scheme when is either very small or large. 

When is neither very small nor large (typically 3 < < 10), then the value of the error norm 
function may not be small at a significant fraction of local minima, which leads to poor performance of 
the gradient descent method. We therefore propose the following two-step algorithm for small (i.e., 
3 < A^ < 10). In the first step, we find a value of = 0" such that \u — J2iLi /iie-'^''/V^p is sufficiently 
small. This step ensures that with high probability, 0" is inside the region of attraction of the global 
minimum of the error norm function. In the second step, with this = 0" = (6'", ■ ■ • , 9'^) as the initial 
vector, a simple gradient descent algorithm would then converge to the global minimum. 

The first step of the proposed algorithm is based on the Depth-First-Search (DFS) technique. Basically, 
for a given u, we start with enumerating the possible values taken by 9^ such that © is satisfied with 
0" = 0". To satisfy (|5]), it is clear that 9^ must equivalently satisfy 



Using Theorem [B this is then equivalent to {^/N/^^7^r^){u - ^^^^j^) e M{{hi, ■■■ , /itv-i)^) i-e. 



ml 



(h(^-i)) < J^L_ u - < M(h(^-i)) (20) 



A^ 



'N 

where h^^"^) = {hi, . . . ,hN-iY and m(-),M(-) are defined in For example M(h(^"^)) = 

||h^^~"^^||i/V^ — 1. Equation (|20|) gives us an admissible set IJ^j C [— tt, tt) to which 9^ must belong for 

(fT9l) to be satisfied. We call this as the k = 0-th "depth" level of the proposed DFS technique. 

Next, for a given value of 9^ G I^, we go to the next "depth" level (i.e., k = 1) and find the 
set of admissible values for 6'J^_i. Essentially, at the k-th depth level, for a given choice of values of 
{9"^, 9^_^, . . . , 6'7v-/c+i)' '^i'^h ^ I^^i^i, i = I, ■ ■ ■ , k, we solve for the set of admissible values 

for 9'^_f^ such that dS]) is satisfied with 0" = 0". From Theorem [U this set (i.e., ) is given by the 
values of 9^_i^ satisfying 



N -k-l 



N 



N -k-l 



(hl^-fc-D) < < " ' il/(h(^-'--i)) (21) 



N 



/N 

where u'^''^ = (n-^lLi and h^^"''"^) = {hi, . . . , hN-k-iY ■ If there exists no solution to 

(|2TI) (i.e., I'^^j. is empty), then the algorithm backtracks to the previous depth level i.e., k — 1, and picks 
the next possible unexplored admissible value for 9'^_^_^^-^ from the set I^_^^i- If there exists a solution 
to (|2T]) . then the algorithm simply moves to the next depth level, i.e.. A; + 1. The algorithm terminates 
once it reaches a depth level of A; = A^ — 1 with a non-empty admissible set Ii. Since u G A^(h), 



11 

the algorithm is guaranteed to terminate (by Theorem [T]). It can be shown that for depth levels less than 
k = N — 2, the admissible set is generally an infinite set (usually a union of intervals in M). Therefore, due 
to complexity reasons, at each depth level it is suggested to consider only a finite subset of values from 
the admissible set (e.g. values on a very fine grid), and terminate once the algorithm reaches a sufficiently 
high pre-defined depth level K with the current error norm \u^^^\ below a pre-defined threshold. In the 
second step, a gradient descent algorithm starting with the initial vector = {O"^, . . . , 9^_j^^i, 0, . . . , 0), 
converges to the global minimum of the error norm function e"(0). In terms of complexity, the first step 
of this two-step algorithm is expected to have a higher complexity when compared to the MRT scheme. 

IV. The Doughnut Channel 

Geometrically the set A^(h) resembles a "doughnut" in the complex plane (see Theorem [T] and Fig. |3]). 
With U C A^(h), and the precoder map in Q, we effectively have a "doughnut channel" (see dV])) 

y=y^u + w , m(h) < |n| < M(h) , w^CAf{0,a^) (22) 

which is a SISO AWGN channel where the information symbol u is constrained to belong to the 
"doughnut" set A^(h). Therefore, with U C A^(h), it is clear that the capacity of the MISO channel with 
per-antenna CE inputs is equal to the capacity of the doughnut channel in (|22|) . which is given by 

Cdonut = sup I{y; u) (23) 

Pni-) ,ueM{h) 

where I{y;u) denotes the mutual information between y and u, and Pui') is the p.d.f. of u. Due to the 
difficulty in deriving an exact expression for Cdonut. we propose an appropriate lower and upper bound. 
The upper and lower bounds presented here will be used in Section |Vl] to quantify the performance of 
the proposed CE scheme when compared to the average-only total power constrained MRT scheme. 

A. An Achievable Information Rate for the Doughnut Channel (Lower Bound on Capacity) 

For = 1, the doughnut set contracts to a circle in the complex plane. In this case, capacity is achieved 
when the input u is uniformly distributed on this circle [|71. 

For > 1, the information rate achieved with u uniformly distributed inside the doughnut set (i.e., the 

p.d.f. of u is pT^iz) = ^(A^(h)2^_^(h)2) , z G Mih)) is given by 

> log2(2'^(") + 2'^^"'/^)) - h{w/^P^) = log2(l + 2'*(")-''(^/v^)) (24) 
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where h{s) = — f Ps{z)\og2{ps{z))dz denotes the differential entropy of the r.v. s (Ps{-) denotes the 



p.d.f. of s). The inequality in (|24|) follows from the Entropy Power Inequality (EPI) [[T5B . which states 
that if y = u + V where u and v are independent random variables, it holds that 2'^^^^ > 2^*^"^ + 2'^^'"\ 



Since u is uniformly distributed inside Ai 
(|24l) . we have the following lower bounJl^ 



^h), we have h(u) = log2(vr(M(h) — m(h) )). Using this in 



Q™„, > /(,; ur' > log, (l + ^ ^m^- '-m (25a) 

Cdonut > I{y; m)™^^ > log2 (l + (using LemmasEH. (25b) 



B. An Upper Bound on the Doughnut Channel Capacity 
Let s = -^p^, and let ps(-) be its p.d.f. We now have 

l{y\ u) = I{s; u) = h{s) -h{s\u) = - [ ps{z) \og2{ps{z))dz - logs f^e^ 

= - [ Ps{z)\og2(^^4~v)^^~ [ Ps{z) ^og2{g{z))dz - hg^fTre^ 
J zee ^9{z)^ J zee ^ Pt' 

< - / Vs{z)\og.2{g{z))dz -XogA-ne—] (26) 
J zee ^ 

where g{z) is some distribution function (i.e., f^^^g{z)dz = 1). Also, for any 2; G C, g{z) > 0. 
D(Ps(-)llfl'(')) denotes the KuUback-Leibler (KL) distance between the distributions ps{ ) and g(-). The 
last inequality in ( |26l ) follows from the fact that the KL distance between any two distributions is always 
non-negative. Since (|26l ) holds for any distribution g(-), we aim to find a g(-) for which the integral 
J^^^Ps{z)log2{g{z))dz can be computed in closed-form, and which also results in a sufficiently tight 
upper bound. We propose to use g{z) = 2/3e~'^^'^^l^l^, /3 > 0. With this choice of g{z) in (|26|) . we have 



4 2 2 

liv; u) < - log2(2/3) + 7r3/32 log2(e) (M(h)^ + 2^ + ^^^(h)') " logal^re^). (27) 



With A?^ > 1, a condition that is required for the usage of EPI to be valid is that M(h) > m(h), since otherwise the set A^(h) 
has a zero Lebesgue measure leading to an undefined h{u). From Lemma [2] and [3] it follows that the condition ||h||i > ||h||oo implies 
M{h) > m(h). Since ||h||i > ||h||oo holds for any h having more than one non-zero component, the required condition is met for most 
channel fading scenarios of practical interest. 
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Minimizing this upper bound w.r.t. the free parameter (3 > gives 

The bound in (|28] ) is always valid irrespective of the distribution of u. Therefore it holds also for the 
distribution of ?i which maximizes I{y;u) subject to m G A^(h). 

Another upper bound to Cdonut is given by the capacity of a MISO channel where the per-antenna 
average-only power is constrained to be Pt/N (i.e., E[|x,jp] = Pt/N ,i = 1, . . . , N) for every channel 
realization h. We shall subsequently refer to this constraint as PAPC. The capacity of the MISO channel 



under a PAPC constraint is given by 



CPAPC = log2 (l + ^M(h)2) (29) 

It is clear that, for a given total transmit power Pt, the PAPC constraint is much less restrictive than the 
CE constraint, and therefore Cdonut < CpAPc- We finally propose the following upper bound on Cdonut 

Cdonut < I^'^ (h, ^) , /(^) (h, ^) ^ min (/« (h, 5) , CpAPc) (30) 

where I^^^ (h, j has been defined in (|28|) . 

V. On the capacity achieving input distribution for the doughnut channel 

For the i.i.d. Rayleigh fading channel, i.i.d. fading channels with bounded channel gain and the DLOS 
channel, with high probability, the inner radius of the doughnut set A^(h) shrinks to as — oo 
(see Section HIH) . This implies that, for large the doughnut channel in (|22|) is essentially a peak- 
input-amplitude only limited SISO AWGN channel, with the per-channel use peak- amplitude constraint 
|m| < M(h), i.e. 

y = y^u + w , |n| < M(h) , w^CAf{0,a^). (31) 

In the following, for large N we exploit this observation to propose a near-optimal capacity achieving 
input distribution (pui )) for the doughnut channel. 

In [[T6ll . it has been shown that, for a peak-input-amplitude only constrained SISO AWGN channel, 
capacity is achieved with channel inputs that are discrete in amplitude and uniform in phase (DAUIP). 

" For the PAPC constrained MISO channel, capacity is achieved by choosing u to be Gaussian distributed unit-energy symbols. For a 



given symbol u to be communicated, the optimal PAPC precoder transmits \/ Pt /N{h* /\hi\)u from the i-th antenna. 
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In our notation, the information symbol u G W^J^^ip, where W^J^jp = ^iLi^^j^avip^ ^^'•^ L e Z+ and 
a = (ai, 02, ■ ■ ■ , (a/ G (0 , 1] , ai < a2 < ■ ■ ■ < «l < !)• ^dauip given by 

i^DAUiP = {veC\ \v\ = aiM{h)}. (32) 

Essentially, the DAUIP alphabet set is composed of L circles in C with the l-th circle having amplitude 
azM(h), / = 1, 2, . . . , L. Furthermore, within a given circle, each point is equally likely (i.e., the phase 
is uniformly distributed). Let the probability that the information symbol u belongs to the l-th circle be 
denoted by j*; , / = 1, 2, . . . , L, J2iPi = 1- In fil6l . no closed-form expressions were given, neither for the 
capacity nor for the capacity achieving input (i.e., L, {«/} and {pi}). However, in [[T6l . numerically it 
was shown that, at low peak-SNR (i.e., low (Pr/(T^)M(h)^ in our notation), it is optimal to use a single- 
amplitude DAUIP alphabet set with L = 1 , ai = 1, whereas with increasing peak-SNR, the number of 
circles in the optimal DAUIP alphabet also increases. 

Based on the above discussion, for i.i.d. Rayleigh fading channel, i.i.d. fading channel with bounded 
channel gains and DLOS channels it can be concluded that, at large N, DAUIP inputs/alphabets are nearly 
optimal in terms of achieving the capacity of the doughnut channel/CE constrained MISO channel. In 
this paper, for a given N and Pt/o-"^, we numerically optimize the ergodic mutual information of the 
doughnut channel w.r.t. L and ai < a2 < ■ ■ ■ < < 1, i.e. 

{L\a*) = arg max EJl{y;u)] (33) 

LeZ+ ,0<ai<-<aL<l 



where, for a given (L, a), u G W^j^jp anco Pi = P2 = ■ ■ ■ = Pl = The numerical optimization in 

(l33l) can be performed off-line and therefore does not impact the online precoding complexity. 



VI. Information rate comparison 

With an average-only total transmit power constraint (ATPC), MRT with Gaussian information alphabet 
achieves the capacity of the single user Gaussian MISO channel, which is given by 

CATPC = log2(l + ||h||2^). (34) 

'^To be precise, Wp^^^jp is chosen to consist of all complex numbers having magnitude m(h) + a;(A/(h) — m(h)). This choice is 

motivated by the fact that, for finite A'^ > 1, even though m(h) is small compared to A/(h), it is not exactly 0. 

" In general, pi = P2 = ■ ■ ■ = Pl = ^/ L need not be optimal in terms of maximizing the ergodic mutual information. However, for 

the i.i.d. Rayleigh fading channel, we numerically observed that, in the practically interesting regime of low to moderate peak-SNR, it was 

optimal to have only a single circle, i.e., L = 1 (for which the trivial probability distribution is pi = 1). Also, designing practical channel 

codes for the doughnut channel would be much simpler when pi = P2 = ■ ■ ■ = PL = ^/ L. 
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Comparing ([341) with ([291) and dBO]) we have 

Cdonut < CPAPC < C'aTPC (35) 

For a desired information rate R, let the ratio of the total transmit power required under the per-antenna 
CE constraint to the total transmit power required under ATPC be referred to as^the "power gap" between 
the proposed CE precoder and the MRT precoder (denoted by -P^p'^^^(-R) ) 



gap ^-"V pMKT(^j^-^ ^-^^^ 



where P^^{R) and P^^^(_R) denote the total transmit power required by the CE scheme and the MRT 
scheme respectively, to achieve information rate R. We can similarly define the power gap between 
the proposed CE precoder and an optimal precoder operating under the R\PC constraint (denoted by 

CE PAPC 

Pgap' ). In the following, we investigate the power gap and the capacity ratios between the proposed 
CE precoder and the R\PC, MRT precoders, at low and high Pt/ct^ (results are summarized in Table H]). 

A. Information Rate Comparison at Low PrM(h)^/cr^ 

From the discussion in Section |V] we know that, at low P2-M(h)^/(T^ and large A^, a single 
amplitude DAUIP information alphabet having complex symbols of magnitude A'/(h) achieves near- 
capacity performance for the doughnut channel. Therefore, in the low PrM(h)^/(T^ regime, the capacity 
of the doughnut channel is roughly equal to that of a SISO non-fading AWGN channel (noise variance 
a^), where the input is constrained to have a constant envelope/amplitude of M(h)^/Pr, i.e., 

y = u + w , \u\ = M{h)^/P^,w r^CAf{0,a^). (37) 



The CE input constrained SISO AWGN channel in (1371) was considered by Wyner in [|71. In [|7i, it was 
shown that for an average power only constrained AWGN channel (i.e., y = u + w), using a CE input 
'instead of the capacity optimal Gaussian input) is almost information lossless for SNR = E[|mP]/(t^ < 1 
3 Hence for PrM(h)2/a-^ < 1 the capacity of the channel in ([37]) is roughly log2(l + PrMihy/a^). 
Using the capacity equivalence between the doughnut channel and the channel in (l37l) . we have 

Cdonut ~ log2(l + ^M(h)2) for ^M(h)2 < 1. (38) 

''^ In the following, we drop the argument R for notational brevity. 



''See Eq. (14) and Fig. 2 in Q. 
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Further, comparing (|38] ) with (|29l) . we finally arrive at the conclusion that at low (Pr/cr^)M(h)^ < 1 



Cdonut ~ CpAPc for ^M(h)2 < 1. (39) 



(t2 



Note that (|39l ) holds for any channel realization h. Using the capacity expressions (|34l ) and (|38l) . we can 
now conclude that, 



PCE,MRT ^ ||hf/M(h)2 = >1 for :^M(h)2 < 1. (40) 

gap II II / V ; /EikM^ ~ ^ ^ - 



It is therefore clear that, at low SNR the power gap will be small when the channel gains from each 
antenna are similar in magnitude, and the power gap can be large when there is a large variation in the 

CE MRT 

channel gains. For the single-path DLOS channel with \hi\ = ■ ■ ■ = \hN\, -Pgap' ~ 1 for any h. For 
i.i.d. Rayleigh fading channel and i.i.d. fading channels with bounded channel gains, using the law of 
large numbers and the Slutsky's Theorem, it can be shown that as — oo 

pCE.MRT ^ N , Hl'T'tl \ ..IS, 

where — j-p denotes convergence in probability (w.r.t. the distribution of h). For i.i.d. Rayleigh fading, this 
asymptotic (in N) power gap limit is 10\og^Q(E[\h,\^]/ {E[\h,\]Y) = 1.05 dB. 

As an illustrative numerical example, for the i.i.d. Rayleigh fading channel (with CJ\f{0, 1) distributed 
channel gains), in Figs. |4] and [51 we plot the ergodic information rate achieved under the ATPC, R\PC 
and CE input constraints for = 4 and = 64 respectively (as a function of Py/cr^). In both figures, for 
the proposed CE precoder with a DAUIP alphabet, we plot the ergodic information rate for different fixed 
values of L (i.e., L is fixed and does not change with Pt/ct^ or with h). For a fixed L and a given Pt/o-"^, 
we numerically maximize the achievable ergodic information rate as a function of a = (ai, ■ ■ ■ , ol)- Note 
that a only varies with Pt/ct'^, and does not vary with h. For the special case of L = 1, we always choose 
«! = 1. From the figures, it can be observed that, indeed at low PrMlh)"^ / a'^ < 1 (corresponding to 
achievable rates log2(l + PrMih)'^ / a'^) < 1 bpcu), as discussed previously, the information rate achieved 
by the proposed CE precoder with a single amplitude DAUIP information alphabet (L = 1) equals 
the MISO capacity under PAPC. This confirms (l39l) . and also shows that the single amplitude DAUIP 
information alphabet (L = 1) is near-optimal for the proposed CE precoder at low P^M (h)^/^^ < 1. Note 
that at low PrMih)'^ /a"^, the ergodic information rate achieved with an information alphabet uniformly 
distributed inside the doughnut set, is strictly sub-optimal. Also, at low PTM{h)'^ /a"^ < 1, the power gap 
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of the proposed CE precoder (DAUIP, L = 1) from the ATPC constrained MRT precoder is about 1.1 dB 
(close to the asymptotic power gap limit of 1.05 dB, see (|4T]) ). Note that, even with small = 4, the 
CE-MRT power gap is close to the asymptotic limit. 

Note that at low Pr/cr^, we have Catpc = log2(l + {PT/(r^)\\h\\^) ^ (Pt/ct^) ||hf Iog2(e). Similarly, 
Cdonut ~ (-Pr/o"^)^^(h)^ log2(e). Therefore for low Pt/o-"^, we have 

^ - ^ for ^Mihf « 1. (42) 
Catpc ||h||^ cr^ 

which converges to (E[|/ij|])^/E[|/ijp] as N oo for the i.i.d. Rayleigh fading channel and i.i.d. fading 

channels with bounded channel gains. For the single-path DLOS channel this ratio is 1, i.e., per-antenna 

CE transmission is optimal even under ATPC. For the special case of = 1, from fj\ it follows that at 

low Prlhl'^/a^ < 1, Cdonut ~ Catpc- 

B. Information Rate Comparison at High PjnM(h)^/o"^ 

In this section we derive lower and upper bounds to the CE-MRT power gap at high PrM(h)^/cr^. 
Using the upper bound to the doughnut channel capacity in (|28l) . it follows that in the asymptotic power 
limit as PTM{hf/a^ — )■ oo, the CE-MRT power gap is lower bounded as 

pCE,MRT > ^IW f ^M(h)^ > 1 (43) 
gap - M{h)^ (j2 y > ^ ^•^> 

For single-path DLOS channels, this lower bound on the CE-MRT power gap equals 3 dB, while 
for the i.i.d. Rayleigh fading channel and i.i.d. channels with bounded channel gains, it converges to 
2E[|/ijp]/(E[|/ii|])^ as A^ oo (this is 4.06 dB for i.i.d. Rayleigh fading channel). Another interesting 
fact is that, at high ■^M(h)^, comparing the doughnut channel capacity upper bound in (l28l) and the 



PAPC capacity in (1291) reveals that for any channel realization h and any A^, 



Pt 



^gap' ^ 2 for ^M(h)' > 1. (44) 

We now obtain an upper bound on the CE-MRT power gap. Using (|25a| ) and (|34l) it follows that, for any 
PrM(h)^/cr^ (not necessarily high), the CE-MRT power gap can be upper bounded as 



N 

For a single-path only DLOS channel with = . . . = |/iAr|, for any h and any A^ > 1 it can be shown 
that 1/k = e (since m(h) = for A^ > 1). With i.i.d. Rayleigh fading and i.i.d. fading channels with 
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bounded channel gains, as — )■ oo, using the law of large numbers and Slutsky's Theorem along with 
the fact that m(h) — )■ as — oo (see Section UlI)) . 

K {E[\h,\])^ / eE[\h,\^] (46) 

Therefore, for the i.i.d. Rayleigh fading channel and i.i.d. fading channels with bounded channel gains, 
in the asymptotic limit as — )• oo, combining (l43l) . (l45l) and (l46l) we have, 



< pCE,MRT < ^OML fiv»l :^M(h)2»l') (47) 

Therefore, with sufficiently large and high PTM{hy /a'^, the difference between the upper and the 
lower bounds on the CE-MRT power gap is 10 log]^o(^/2) = 1-33 dB irrespective of the channel fading 
distribution (as long as the channel gains are bounded). For i.i.d. Rayleigh fading, the asymptotic upper 



and lower bounds on the CE-MRT power gap are 5.4 and 4.1 dB respectively, see Fig 



For the 



practically interesting low to moderate P^Af (h)2/cr^ regime, with DAUIP alphabets the CE-MRT power 
gap is usually lesser than its asymptotic lower bound. We illustrate this fact through Fig. [6l where we 
plot the ergodic information rate as a function of increasing Pt/o-"^ for the MRT and the proposed CE 
precoder (i.i.d. Rayleigh fading channel). The reported ergodic rate for the proposed CE precoder is with 
the proposed best DAUIP information alphabet in (l33l) . It can be seen that with a properly chosen DAUIP 
information alphabet, the CE-MRT power gap is roughly 3.5 dB for a desired information rate of 3 bpcu. 
Also, the CE-MRT power gap is small even for N = 2, which makes CE transmission possible for 
conventional TX with few antennas. 

We now investigate the ratio Cdonut/C'ATPC at high PTM{hy /a"^. For A^ = 1, it is known that, at large 
Prlhil'^ / cr'^ (i.e., large Catpc). capacity with a CE input is roughly half of the channel capacity under 
ATPC ItZj. This fact is illustrated in Fig.[6l where, for A^ = 1 the channel capacity under CE transmission 
has a much smaller slope w.r.t. Pt/ct'^ as compared to the slope of the channel capacity under ATPC. For 
A^ > 1, using (l25al) . (l34l) and ([35]) it can be shown that 

^ > 7^ > —p; ^ ^ - -7^ • <.48) 

(-'ATPC CvATPC tvATPC 

In Fig. |5] note that the power gap lower bound at a desired information rate of 3 bpcu is only 2 dB, as compared to the power gap 
lower bound limit of 4.1 dB. This is because, for a desired rate of 3 bpcu, the corresponding PTM{h)^ /a^ is still not high enough for the 
asymptotic lower bound in ( 147b to be valid. A stronger result which can be seen by comparing ( 143 1 and ( 145b is that, for channels where 
m(h) as N oo, at high PTM{h)^ /a^ the upper to lower bound gap is 1.33 dB for any h (not limited to i.i.d. fading). 
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For the i.i.d. Rayleigh fading channel and i.i.d. fading channels with bounded channel gains, the 
convergence in (|46|) implies that, for any arbitrary e > 0, there exists an integer A^(e) such that with 
> A^(e), the probability that a channel realization will have a value of n > eg^jj'^'.p] — e is greater than 
1 — e. For single-path DLOS channels we already know that k = 1/e for > 1. Compared to = 1, 
with ^ 1 and high P^M (h)^/cr^, from (|48l) it follows that CE transmission can achieve an information 
rate close to the capacity Catpc under ATPC, since 1 — ^°c^pQ close to 1 (as Catpc is large, and k 
is greater than a positive constant with high probability), i.e. 



for Ar»l , 
Catpc cr 



1 for A^ > 1 , ^M(h)2 > 1. (49) 



This fact is illustrated through Fig. |4] and Fig. [51 where it can be seen that for both A^ = 4 and A^ = 64, 
the slope of the ergodic information rate achieved with per- antenna CE transmission (with information 
symbols uniformly distributed inside the doughnut set) is the same as the slope of the ergodic channel 
capacity under ATPC. Similar observations can be made from Fig. [6] with DAUIP alphabets. The intuitive 
reasoning for this observation is as follows. For A^ = 1, the doughnut set is a circle in the complex plane, 
due to which information symbols have the same amplitude and differ from each other only in the phase 
(i.e., they exploit only one degree of freedom for information transmission). In contrast, with A^ > 1, the 
doughnut set includes all complex numbers with amplitude in the range [m(h) , M(h)], which implies 
that information symbols can vary in both phase and amplitude (exploiting both degrees of freedom). 

VIL Achievable Array Power Gain 

For a desired rate R and a given precoding scheme, with A^ antennas, the array power gain achieved 
by this scheme is defined to be the factor of reduction in the total transmit power required to achieve a 
fixed rate of R bpcu, when the number of TX antennas is increased from 1 to A^. Under ATPC, with A^ 



antennas the MRT precoder achieves an array power gain of (using (1341) ) 



I/) |2 

GMRT(^) = ^^=1 l^-l (50) 



\hi\ 

which is 0{N) for i.i.d. fading and DLOS. With CE transmission, using the R.H.S of (I25al) as the 
achievable information rate, the array power gain achieved with A^ antennas is given by 

1^) = ^^ 7, r- — ; f (51) 

2 {{ Ell 1/^.1/2} -(|/ii|-|/^2|)V4} 

where G^^{R) is the array power gain achieved with only 2 antennas and depends only on hi and h2. 

From (|5TI) . it is clear that G'^{R) is 0{N) for i.i.d. Rayleigh fading, i.i.d. fading with bounded channel 
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gains and DLOS (for i.i.d. Rayleigh fading and i.i.d. fading with bounded channel gains, \ hi\/N — j-p 
E[|/ii|] and m(h)^/A^ — >p as — )• oo). Therefore, for practical fading scenarios like i.i.d. Rayleigh 
fading, i.i.d. fading with bounded channel gains and DLOS, an 0{N) array power gain can indeed be 
achieved even with per-antenna CE transmission. 

This conclusion is validated in Fig. |71 where we plot the minimum Pr/o"^ required by the CE, MKT, 
and the PAPC precoder to achieve an ergodic information rate of i? = 3 bpcu. For all precoders, it is 
observed that, at sufficiently large N, the required Pt/o''^ reduces by roughly 3 dB with every doubling 
in the number of TX antennas. This confirms the fact that, an 0{N) array power gain can be achieved 
even with per-antenna CE transmission. The minimum required PT/cr"^ is also tabulated in Table HIl 



In scenarios where the channel coherence time is much longer than the end-to-end delay requirements 
and where a constant data throughput rate is desired, we are faced with the possibility of an outage, 
wherein the channel capacity is less than the desired information rate. The outage probability under 



ATPC is defined as P^^^'^iR, Pt/ a"^) = Prob(CATPC < R) = Prob(||hf < (2^ - l)(r^/PT) where R is 



the desired constant information rate. To have a low outage probability, one needs to increase the total 
transmit power P^. With large N, due to the increased degrees of freedom in the r.v. ||hp (x^ distributed 
with 2N degrees of freedom for i.i.d. Rayleigh distributed channel gains) it is clear that, under ATPC 
the slope of the outage probability for the MISO channel w.r.t. Prjo'^ increases with increasing N (on 
a log-log plot this slope in the asymptotic limit of Pt/ct'^ — ?• oo is commonly known as the "diversity" 
order). Further, a higher slope at large N implies that less extra Pt would be required to achieve a fixed 
decrease in the desired outage probability. However, it is not clear, as to whether the above conclusion is 
valid even under per-antenna CE transmission. 

Using the proposed upper and lower bound to Cdonut (see Sections IIV-AI and IIV-BI) we can derive lower 
and upper bounds to the outage probability of the proposed CE precoder. The outage probability of the 
proposed CE precoder is given by 



VIIL Outage probability under per-antenna CE transmission 



P^^iR,PT/a') 



A 



Prob(Cdonut < R) 




(52) 
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where the second inequality follows from the upper bound to Cdonut in (l30l ). since {I^^^i \ h, ^ j < R} 
implies that {Cdonut < R}- Similarly, by using the lower bound to Cdonut in (125 al) we get the following 
upper bound on P^^(i?, Pr/cr'^) 

P^^(R,Pr/.') < Prob(log, (l + ^Ml^li^I^M) < «) (53) 



The diversity order achieved is defined as 



^og{P^^{R.PT/a')) 



dZ = ;im '"""7p";vr " (54) 



In Appendix\B\we analytically show that d^^f > {N — 1) for the i.i.d. CA/'(0, 1) Rayleigh fading channel. 
This result is tight for large A^, since the maximum achievable diversity order is A^. 

We support the above conclusion through Fig. [8l where we plot the upper and lower bounds on the 
outage probability of the proposed CE precoder as a function of Pr/cr'^ for = 2, 4, 16, 64 (i.i.d. Rayleigh 
fading). The bounds on the right hand side of (l52l) and (l53l) . have been computed through simulations. 
It can be seen that for a constant desired rate of i? = 2 bpcu, the slope of the outage probability curves 
increase with increasing A^. 

IX. Overall Improvement in Power Efficiency by using CE Transmission 

On one hand, with CE transmission we improve the power efficiency by enabling the use of highly 
power-efficient amplifiers, but at the same time, restricting the per-antenna channel inputs to CE (since 
highly power-efficient amplifiers are generally non-linear) requires extra transmit power (compared to 
Gaussian inputs) to achieve a fixed desired information rate. If this extra transmit power is significantly 
smaller than the improvement in power efficiency gained by using highly power-efficient amplifiers, then 
it is clear that using per-antenna CE transmission will lead to an overall gain in power efficiency. 

Motivated by the above discussion, for a TX with A^ antennas, compared to using highly linear and 
power-inefficient amplifiers with Gaussian inputs (MRT precoder), the overall gain in power efficiency by 
using highly power-efficient amplifiers with per-antenna CE inputs is given by p = ^'pat?""^^"^^'^ / -Pgap '^^^ 
where PAEj^.jjjjgj^j. and PAEJj„gJ^. denote the power-efficiency of non-linear and linear power amplifiers 



respectively^ For a highly linear power amplifier, PAEjjjjgj^j. ^ 0.15 — 0.25, whereas a highly power- 
efficient but non-linear amplifier has PAE„o„.jj„gjy. ^ 0.7 — 0.85 ifTTl . As an illustrative example, with 

"For an RF power amplifier, the power efficiency is the ratio of the total RF power radiated to the total amplifier input power 
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PAEijjjg^j. = 0.2 and PAEjjQjj.jjjjg^r = 0-8' using analytical results on Pg^'^^^ (see Section |VIl), it follows 
that in single-path DLOS and i.i.d. Rayleigh fading channels it is indeed beneficial to use per-antenna 
CE inputs with highly power-efficient amplifiers (Pg^'^^^ < 4 (6dB) implies that p > 1). At practically 
interesting low to moderate values of P^Af (h)^/cr^, for i.i.d. Rayleigh fading channels p varies from 4.95 
dB (at rates below 1 bpcu) to 2.5 dB (at an information rate of 3 bpcu). 

X. Conclusions and Future Work 

In this paper, we derived an achievable rate for a single-user Gaussian MISO channel under the constraint 
that the signal transmitted from each antenna has a constant envelope. We showed that for i.i.d. Rayleigh 
fading channels, i.i.d. fading channels with bounded channel gains and DLOS channels, even with the 
stringent per-antenna CE constraint, an 0{N) array power gain can be achieved with antennas. Also, 
compared to the average-only total transmit power constrained channel, the extra total transmit power 
required under the CE constraint to achieve a desired rate (i.e., power gap), is shown to be bounded and 
small. We conjecture that these results hold true for a much broader class of fading channels, and are not 
limited to i.i.d. Rayleigh fading, i.i.d. fading channels with bounded channel gains and DLOS channels. 
We are currently extending the results in this paper to the multi-user setting, see [[T8l . 
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Before discussing the main result, we make some definitions. For a random channel vector h = 
{hi, /i2, ■ ■ ■ ) , let Zi = \hi\^. Further, let , z = 1, 2, ■ ■ ■ , be defined to be the i-th smallest 
value among Zi, ■ ■ ■ , Z^- Therefore, we have < < Z{2) < ■ ■ ■ < Z^^^ < oo. 

Theorem 2: For an i.i.d. CA/'(0, 1) Rayleigh fading channel, for any constant c > 



Appendix A 



On the order of m(h) as iV oo 




(55) 



where m(h) has been defined in ([8]). 



Proof - It suffices to prove that 




1 



(56) 



Further, since m(h) < 



maxi^i^,.,^jv \hi\ 



(Lemma [3]), it suffices to show that 




1 



(57) 
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In terms of the newly defined random variables above, this is equivalent to proving that 

lim Pioh(Z(N) < c'^log^N] = 1 (58) 

Due to i.i.d. CJ\f{0, 1) Rayleigh fading, the random variables Zi,i = 1, 2, ■ ■ ■ , N are i.i.d. exponentially 
distributed with mean value 1. Therefore 

N N 
Prob(^Z(7v) < C^log^A^) = JJProb(^Zi < log^ A^) = H 



i=l i=l 
^2 i„„2 



., ; , N / 1 \ N 

1 - ^-""-' l = (l - W^) ■ (59) 
We next show that 

jhn log Prob < log^ n) = lun^ N log (l - = (60) 

from which (l58l) follows immediately. To prove ([601), note that for any c> and all N > 2, ATc^iogA^ ^ ^ 
Further, using the inequality log(l — x) < —x for < a; < 1 [[T9ll , for > 2 we have 

^iQg 0- jv^^iogiv ) ^ - ^c^Togiv (61) 

Using (|6T1) we have 

lim A^logfl ^^-T7) < - lim — = 0. (62) 

Using the inequality log(l — x) > —x/{l — x) for < x < 1 |fT9l , for > 2 we have 

A^log (^1 - ^c2iogyv) - ~ jycnogAf -j^ _ g-c2log2Ar ^^^^ 

which implies that 

lim iVlogfl > - hm — — \ — ^ = 0. (64) 



Combining (1641) and (1621) proves (1601) which completes the proof. 

Appendix B 

Diversity analysis for the Outage probability of the proposed CE precoder 
Using the lower bound on Cdonut in (I25bl) . an upper bound on the outage probability is given by 

P^^,{R,PT/a^) = Prob(Cdonut < i?) < Prob(log2 (l + ^^^^^^^^) < i?) (65) 
In terms of the new random variables defined at the beginning of Appendix |Al we have 

TV 2 

l|h||?-||h||^ = (5^v^) -V) > 5^%). (66) 



i=l i=l 
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Using this fact in (|65l) . we have 



P^^{R,Pr/a') < Prob(log,(l + -| ^-=^^ ^'^ ) < r) (67) 

Let us define random variables 

Yi = (iV-^ + 1) (%-%_!)) ^ = 1,2,- ■■ ,iV. (68) 

Note that Yi = NZ(i). For the i.i.d. CA/'(0, 1) Rayleigh fading channel, it is known that G [0 , oo) , i = 
1, 2, ■ ■ ■ ,N are i.i.d. exponentially distributed random variables with mean 1 (see section 2.7, page 17 in 



201 ). From the definition above, it immediately follows that 

N-l N-l 

^ = Z(jv-i) + ^(0 (69) 



i=l i=l 



which implies that " 



1=1 i=l 

since Yi and are non-negative random variables. Using (TTOl ) in (|67l) we have 

pS(R.PtIo') < Prob(log,(l + 5^^)<fl) = Prcb(|:5^.<M|lzi)),7,) 

4 = 1 ' 



Since, the event ^'.J^' Yi < p-'i'^ ^ impUes that each Yi < p^)j we further have 

(/?, Pr/a^) < Prob(F, < ^""^^^^^7 , ^ = 1, 2, ■ ■ ■ , iV - l) (72) 
Since 1^ are i.i.d. exponentially distributed, the right hand side in the above can be further simplified to 
PS(R.Pr/<r^) < n Prob(r. < ^ (l - e-*)"- (73, 

2 = 1 

The diversity order achieved by the outage probability therefore satisfies 



where we have used (l73l) for the inequality. Using the identity 



lim Ml^f:!!) = 1 (e > 0) (75) 

a: -5-0 log X 



with X = 0-7 Pr and c = 2eiV(2^ - 1) > 0, we have 



log ) 

which then proves that 



- log 1 - e ~fW^ 
lim ^ 7 — r = 1 (76) 



c^out > {N-l) {11) 
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TABLE I 

Capacity ratios/power gap of CE transmission w.r.t. MRT and PAPC transmission 





iV> 1 


N = 1 




|fAf(h)2 < 1 


If Af (h)2 » 1 


^\hi\'«l 


^\hi\'»l 




i.i.d. Rayleigh fading, 

i.i.d. fading channels 

with bounded channel gains 


DLOS 


i.i.d. Rayleigh fading, 

i.i.d. fading channels 

with bounded channel gains 


DLOS 






pCE.MRT 
(dB) 


10 lo- ( ■^'I'^'I'l ) 





>3 + 101ogio((||7g.) 
<4.3 + 101ogio((iS^) 


> 3 
< 4.3 





CO 


pCE.PAPC 
(dB) 








> 3 
< 4.3 


> 3 
< 4.3 





oo 


'^donut 


(E iH ]r 


1 


1 
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<^atpc 




"-^donut 
'^PAPC 
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1 
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TABLE II 

Signal-to-noise-ratio Pt/(t'^ (dB) required to achieve an ergodic rate of 3 bpcu (i.i.d. 

CA/'(0, 1) Rayleigh fading) 





N=l 


N=2 


N=3 


N=4 


N=8 


N=16 


N = 32 


N = 64 


MRT (ATPC) 


10.2 


6.4 


4.3 


2.9 


-0.4 


-3.5 


-6.5 


-9.5 


PAPC 


10.2 


6.9 


5.0 


3.7 


0.6 


-2.5 


-5.5 


-8.6 


CE (best DAUIP) 


14.3 


9.8 


7.6 


6.2 


3.1 





-3.0 


-6.0 


CE (UNIF) 


14.3 


10.4 


9.0 


8.2 


5.0 


1.8 


-1.3 


-4.4 



27 




(a) MRT (b) CE 

Fig. 1. Maximum Ratio Transmission (MRT) versus per-antenna Constant Envelope (CE) constrained transmission, for a given average 
total transmit power constraint of Pt- h — {hi, • • • , Hn)'^ is the vector of complex channel gains. 




Fig. 2. Mean value of the ratio m(h)/Af(h) as a function of increasing A'', for i.i.d. CN{Q, 1) Rayleigh fading. We calculate m(h) in 
^ using an iterative gradient descent type method. 
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8.3.! 

J3 



. MISO Capacity (ATPC) 
. MISO Capacity (PARC) 
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Fig. 4. Ergodic information rate vs. Pt/(t^, for i.i.d. CAf{0, 1) Rayleigh fading and A'' = 4. 
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P/a (dB) 



Fig. 5. Ergodic information rate vs. Pt/o^ for i.i.d. CA/'(0, 1) Rayleigii fading and A'^ — 64. 
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^- N = 1 , CE Capacity (Wyner) 
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Fig. 6. Ergodic information rate vs. Pt/o-^ for i.i.d. CA/'(0, 1) Rayleighi fading and A'^ = 1, 2, 4, 16. 
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Fig. 7. Minimum Pr/cr^ required to achieve an ergodic information rate of 3 bpcu as a function of tlie number of antennas A^, for i.i.d. 
CA/'(0, 1) Rayleiglr fading. 
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Fig. 8. Upper and lower bounds on the outage probability of the proposed CE precoder at rate 2 bpcu, as a function of Pt/o"^, for i.i.d. 
CA/'(0, 1) Rayleigh fading. 



